The Corpus of Czech Verse

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generating Czech Iambic Verse

In the paper, we describe an algorithm for generating Czech iambic verse and its implementation on a computer. It is a continuation of the work first done in 1972 [6], in which a program generating Czech iambic verse had been developed, written in the programming language Algol-Genius and run on the mainframe SAAB D21 with interesting results. Here, we present a new experiment which is a follow...

متن کامل

The Czech Broadcast Conversation Corpus

This paper presents the final version of the Czech Broadcast Conversation Corpus that will shortly be released at the Linguistic Data Consortium (LDC). The corpus contains 72 recordings of a radio discussion program, which yields about 33 hours of transcribed conversational speech from 128 speakers. The release does not only include verbatim transcripts and speaker information, but also structu...

متن کامل

Building Big Czech Corpus

This paper describes creating of a big Czech corpus frommany Czech corpora kept on the NLP Centre server. It describes new tools developed for this purpose, difficulties which may come up and a way how solve them.

متن کامل

The Nijmegen Corpus of Casual Czech

This article introduces a new speech corpus, the Nijmegen Corpus of Casual Czech (NCCCz), which contains more than 30 hours of high-quality recordings of casual conversations in Common Czech, among ten groups of three male and ten groups of three female friends. All speakers were native speakers of Czech, raised in Prague or in the region of Central Bohemia, and were between 19 and 26 years old...

متن کامل

Analysis of Czech Web 1T 5-Gram Corpus and Its Comparison with Czech National Corpus Data

In this paper, newly issued Czech Web 1T 5-grams corpus created by Google and LDC is analysed and compared with reference n-gram corpus obtained from Czech National Corpus. Original 5-grams from both corpora were post-processed and statistical trigram language models of various vocabulary sizes and parameters were created. The comparison of various corpus statistics such as unique and total wor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Studia Metrica et Poetica

سال: 2015

ISSN: 2346-691X,2346-6901

DOI: 10.12697/smp.2015.2.1.05